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Abstract. We study the entropy of the distribution of the set R„ of vertices 
visited by a simple random walk on a graph with bounded degrees in its first 
n steps. It is shown that this quantity grows linearly in the expected size 
of R n if the graph is uniformly transient, and sublinearly in the expected 
size if the graph is uniformly recurrent with subexponential volume growth. 
This in particular answers a question asked by Benjamini, Kozma, Yadin and 
Yehudayoff [J. 



1. Introduction 

Let {X n ) n > be a simple random walk on a graph G — (V, E) starting at some 
vertex o e V. The entropy of the distribution of X n for large n has been studied 
on Cayley graphs and is known to be related to other objects of interest, such as 
the rate of escape and the existence of non-trivial bounded harmonic functions, see 
03, [S], [ID]- This work is devoted to the entropy of a similar observable, to that of 
the range of the random walk. Let R n = {Xq, Xx,..., X n } be the set of vertices 
visited by the random walk in its first n steps. The entropy of R n is defined as 



(1.1) H {R n ) = E 



log 



Po{Rn 



where the random walk starts at o G V, P -&.s., p (A) — P [R n — A] for any 
finite connected set A C V containing o, and 01og(l/0) is defined to be as 
usual. According to Shannon's noiseless coding theorem [3J, H (R n ) /log 2 can 
be interpreted as the approximate number of 0-1-bits required per realization in 
order to encode a large number of independent realizations of R n with negligible 
probability of error. Up to an additive constant, H (R n )/ log 2 can also be viewed 
as the expected number of bits necessary and sufficient to encode R n (see [3J, 
Theorem 5.4.1), and as the expected number of fair coin tosses required for the 
simulation of R n (see [3J, Theorem 5.12.3). 

The entropy of i?„ for random walk on Z d , d > 1, is investigated in a recent work 
by Benjamini, Kozma, Yadin and Yehudayoff l a . There, the authors show that the 

l 



large n behavior of H (R n ) is of order n for d > 3, of order nj log 2 n for d = 2 and 
of order logn for d — 1. Comparing this behavior with that of the expected size 
(Rn)o of Rni 

(1.2) (i?„)o = £ [K|], 

known to be of order n for d > 3, of order nj logn for d — 2 and of order -^/n for 
d = 1 (cf. jS], p. 221) , we observe that on Z d , H (R n ) grows linearly in (R n }o in the 
transient case and only sublinearly in the recurrent case. On general graphs with 
bounded degrees, H (R n ) can grow at most linearly in {R n ) (see Proposition ^. 4|) . 
but the assumptions on recurrence and transience alone do not allow us to conclu- 
sively compare H (R n ) with (R n )o- Under slightly stronger assumptions, however, 
we can generalize the observation just made for Z d to large classes of graphs. 

The graphs we consider in this work are characterized by strengthened transience 
and recurrence conditions. Recall that a graph is transient if the escape probability 
P [o $l {Xl,X2, . . .}] is strictly positive for any starting vertex o € V. We call a 
graph G uniformly transient if such an estimate holds uniformly in o, that is, if 

(1.3) inf P [o^{Xx,X 2 ,...}} >0. 

Theorem 11.11 shows that H a (R n ) grows linearly in {R n ) on uniformly transient 
graphs with bounded degrees. Such a statement does not hold under the assumption 
of transience alone, see Remark 13.51 for a counterexample. 



Theorem 1.1. Let G be a uniformly transient graph with bounded degrees. Then 

(1.4) Urn inf inf H ° {Rn) > 0. 

n->oo oEV (R n )o 

On the other hand, we call a graph G uniformly recurrent if 

(1.5) sup P [o £ {Xi, . . . ,X n }] — > 0, as n -> oo. 

In addition, let \d e B(x,r)\ be the number of vertices at distance r + 1 from iey 
with respect to the usual graph distance. If the degrees of the graph are bounded 
by d > 2, then \d e B(x,r)\ < d r+1 . The following condition requires slightly more: 

(1.6) for any e > 0, sup \d e B(x, r)\ < e er , for infinitely many r € N. 

Under these conditions, we prove that H {R n ) grows only sublinearly in {R n ) . We 
note that recurrence alone does not imply such a statement, see Remark 14.11 and 
also Remark 15.41 

Theorem 1.2. Let G be any uniformly recurrent graph with bounded degrees sat- 
isfying (|1.6|) . Then 



n ^ H (R n ) 

(1.7) sup > 0, as n — > oo. 

oeV \Rn)o 

The above results apply in particular to all vertex-transitive graphs. Recall that 
a graph G is vertex-transitive, if for every pair of vertices (x, x'), there is a bijection 
cj) from V to V such that (j>{x) = x' and d(y,y') — d{<j>{y),(f>{y')) for all y,y' G V, 
where d(., .) denotes the usual graph distance. For vertex-transitive graphs, H (R n ) 
and (R n )o do not depend on the starting vertex o of the random walk, so we omit o 
from the notation. We can deduce the following dichotomy from the results above: 
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Theorem 1.3. Let G be a vertex-transitive graph. 



(1.8) 



If G is transient, then lim inf 



gCgn) 



> 



(1.9) 



i/ G is recurrent, then 




0, as n — > oo. 



For uniformly transient graphs, (R n ) grows linearly in n (see Proposition 15.21) . 
Hence, 1)1.80 in particular shows that H(R n ) grows linearly in n on all vertex- 
transitive and transient graphs, thus answering a question asked in [I], Section 3.3. 

We now comment on the proofs, starting with the one of Theorem 11.11 given 
in Section [3] A simple observation made in [I] shows that H (R n ) is bounded 
from below by a constant times the expected size of the interior boundary diR n 
of R n - It thus suffices to prove a lower bound of order (Rn} on _E [|<9ii?„|], or, 
equivalently, to prove that the fraction of the visited vertices belonging to &iR n is 
non-degenerate. Note that a vertex belongs to diR n if it is visited by the random 
walk, but at least one of its neighbors is not. Not all visited vertices are equally 
likely to belong to diR n . Indeed, a vertex in the middle of a long path of vertices 
of degree 2 is very unlikely to end up in the boundary of R n , because the random 
walk typically returns many times and visits both of its neighbors before escaping 
from it. In order to avoid such situations, we observe that on uniformly transient 
graphs, vertices of degree at least 3 exist within a bounded distance of every vertex 
(cf. Lemma 13.30 . and use uniform transience to prove that whenever the random 
walk visits such a vertex x, there is a non-degenerate probability that the walk then 
escapes from the ball B(x, 1) of radius 1 around it and leaves x in diR n . This yields 
the lower bound on _E [|<9ii?„|] required to prove Theorem ll.il In order to show that 
transience alone is not a sufficient assumption for the conclusion of Theorem 11.11 
we consider a binary tree, where every edge at depth I is replaced by a path of 
length 1 + 3, and show that this graph satisfies (|1.70 . see Remark 1 3. 5 1 and Figured] 

In the proof of Theorem 11.21 in Section 2) we cover every possible realization of 
R n with one of at most sup^gy \d e B(x, r )|( const -)l-RnlA different collections of balls 
of radius r > 1 . Uniform recurrence implies that most of these balls are typically 
completely covered by R n . We then consider the conditional entropy of R n , given 
the number K of such balls intersected by R n and the number L of balls that are 
not completely filled by R n (the definition of conditional entropy is recalled in (12.40 
below). The conditional entropy can then be bounded by the expected logarithm 
of the number of possible configurations R n can belong to, given (K, L) (cf. 1)2.71) 1. 
By assumption (|1.6|) . the expected logarithm of the number of possible collections 
of covering balls can be made smaller than e(R n } for any fixed e > by choosing r 
appropriately, while the assumption of uniform recurrence yields a similar bound on 
the expected logarithm of the number of possible choices of the exact configurations 
of R n in the few unfilled balls. This argument proves the required estimate on the 
conditional entropy of R n given (K, L). Since the entropy of (K, L) itself is only of 
order at most logn, this is sufficient for Theorem ll.2l Theorem ll.2l cannot be proved 
for every recurrent graph. As a counterexample, we consider a sequence of finite 
binary trees with rapidly increasing depths, connected together by a half-infinite 
path, see Remark HTD and Figured) 
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The article is organized as follows: In Section [51 we introduce notation and 
preliminary results on entropy, derive a general lower bound on (R n ) and show 
that H a (R n ) grows at most linearly in (R n } . Section [3] contains the proof of 
Theorem 1 1 . 1 1 and Section [4] proves Theorem II .21 Finally, the dichotomy for vertex- 
transitive graphs in Theorem 11.31 is deduced in Section [5l 

Acknowledgement. The author is grateful to Itai Benjamini for helpful con- 
versations and to an anonymous referee for helpful remarks. 

2. Notation and preliminaries 

In this section, we introduce the notation and prove some preliminary results. 
These include a lower bound of order y/n on the expected size of R n on a general 
infinite graph with bounded degrees, obtained in Proposition 12.31 by adapting an 
argument in [7], as well as the observation that H {R n ) grows at most as fast as 
the expected size (R n )o on any infinite connected graph with bounded degrees. 

Throughout this article, we let G — (V, E) be a graph. V denotes the set 
of vertices and E the set of edges consisting of unordered pairs of vertices in V . 
Whenever {x, y} G E, we write x ~ y and call the vertices x and y neighbors. The 
number of neighbors of a vertex x is always assumed to be finite and referred to as 
the degree of x, denoted deg(x). A path of length / is a sequence (xo, x\, . . . , x{) of 
vertices in V such that Xi ~ Xi+i for < i < I — 1. The distance d{x, y) between 
any two vertices x and y is defined as the length of the shortest path starting at 
xq = x and ending at x d ( xy j = y, and defined to be oo if no such path exists. G 
is said to be connected if d(x, y) < oo for all x, y € V. For any x G V, r > 0, we 
define the ball B(x, r) = {y E V : d(x, y) < r}. The graph G has bounded degrees 
if sup xeV deg(x) < d for some d > 1. For any set A of vertices, we define the 
interior and exterior boundaries of A by diA = {x £ A : x ~ y for some y € V \ A} 
and d e A = d l (V\A). The subgraph G(A) = (A, E(A)) induced hy AQV consists 
of the vertices in A and the set of edges E(A) = {{x, y} E E : {x, y} C A}. We say 
that A is connected if G{A) is connected in the above sense. The cardinality of any 
set A is denoted by \A\, the largest integer less than a E M by [a] and the minimum 
of numbers a, b E K by a A b. Throughout the text, c or c' are used to denote 
strictly positive constants with values changing from place to place. Dependence 
of constants on additional parameters appears in the notation. For example, Cd 
denotes a positive constant possibly depending on d. 

For any x E V, we denote by P x the law on V 71 (equipped with the canonical a- 
algebra generated by the coordinate projections) of the Markov chain on V starting 
at x with transition probability 

, \_ / l/deg(x), if x ~ y, 
P\ x i V) — | o, otherwise, 

and write (X n ) n >o for the canonical coordinate process, referred to as the simple 
random walk on G. The corresponding expectation is denoted by E x , the canonical 
shift-operators on V n by (0 n ) n >o- Note that deg(x)p(x, y) — deg(y)p(y, x), meaning 
that the measure it : A h->- ^2 xeA deg(x) on V is a reversible measure for the simple 
random walk. The first entrance and hitting times of a set A of vertices are defined 
as 

(2.1) t a = mi{n > : X n E A}, r+ = inf{n > 1 : X n E A}, 
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where we write t x rather than ts x \ if A consists of a single element x. The capacity 
of a finite nonempty subset A of V is defined as 

(2.2) cap(A) = p xH = TO 1 deg(«)- 

We will generally refer to the starting vertex of the random walk as o. It will be 
convenient to define the collection C as 

(2.3) C = {A C V : A is finite, connected, and o G A}. 

Note that i?„ G C Q , P Q -a.s. The entropy of the range R n of the random walk has 
been defined in (11.11) . More generally, for any random variable X taking values in 
a countable set X, we define the entropy of X by 



H (X) = E 



log 



Po{X) 



where p {x) = P D [X = x], for x € X. 



Given another random variable Y taking values in a countable set y, the conditional 
entropy of X given Y is defined by 

(2.4) H (X\Y) = H ((X,Y)) - H (Y). 
An application of Jensen's inequality shows that 

(2.5) H {X) <\og\X\, 
while the following estimate is elementary: 

(2.6) H {X) < H (X\Y) + H {Y). 
Moreover, we have the following useful lemma: 

Lemma 2.1. For random variables X and Y with values in countable sets X and 

y, 

(2.7) H {X\Y) < E [\og\X Y \], where 

X y = {x g X : P [(X,Y) = {x,y)\ > 0}, for yey. 

Proof. Using Jensen's inequality, we have 

P o [(X,Y)=(x,y)]>0 

- E PoW - y] E Po[x = x \Y = y\ log ( Po[x= ^ Y } 

yey xex v v ° L 1 yi 

< E [\o g \x Y \}. a 

The following simple lemma, combined with a covering argument, will be instru- 
mental in proving the bound on H a (R n ) in Theorem 11.21 

Lemma 2.2. Let G = (V, E) be any graph with bounded degrees, o G V, and let 
A G C (cf. (|2.3[) ). Then there exists a nearest-neighbor path in G(A) starting at o, 
covering A, and of length at most 2\A\. 
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Proof. The standard depth- first search algorithm (see [5]) yields a spanning tree 
Ta = (A, Et) of G(A) (i.e. a tree with vertices A and edges Et Q E(A)), as well as 
a nearest-neighbor path in Ta covering A and traversing every edge in Et at most 
once in every direction. Since the length of such a path is bounded from above by 
2\E T \ = 2(\A\ - 1) < 2\A\, the statement follows. □ 

We now prove a general lower bound on the expected range of random walk on 
an infinite connected graph with bounded degrees. This lemma will allow us to 
disregard small errors when relating H (R n ) to (R n ) - 

Proposition 2.3. Let G = (V,E) be any infinite connected graph with bounded 
degrees. Then 

(2.8) liminf inf > 0. 

n— >oo oev ■Jn 

Proof. This proof is an adaptation of an argument appearing in \7\ in the context 
of random walk on Cayley graphs. Let (Nk)k>i be the successive times when the 
random walk visits a new vertex, that is, N% — 0, and for k > 1, Nk — inf{n > 
Nk-i : X n ^ {X , . . . Xx k _ 1 }}. Then we have for k > 1 and o € V, by the 
Chebychev inequality, 

Po[\R n \ < k] = P o \\{0 < I < n : Xi e {X Nl ,. . .,X Nk }}\ = n + 1 



< 

12 ' ' l<i<k L 0<l<n 

Noting that Xi ^ X^ i for I < N% and applying the strong Markov property at time 
Ni, we deduce that 

Po[\Rn\ <k}<- J2 SU P = 4 

11 0<l<n x£V 



By a general heat-kernel upper bound, we have sup x yl£V P x [Xi = y] < Cd/vT+T, 
for some constant Cd > depending on the uniform bound d on degrees (see, for 
example, [TT], Corollary 14.6, p. 149). Hence, we find 

Po[\Rn\ <k]< c d -^=, for k > 1. 

This last inequality, applied with k — [y/n/ (2cd)}, yields 

Eo[\Rn\] > ^ P o[\Rn\ > W^/{2c d )]] > ^. □ 

Finally, we prove that on any infinite connected graph with bounded degrees, 
H Q (R n ) does not grow faster than (R n )o- 

Proposition 2.4. Let G — (V, E) be any infinite connected graph with degrees 
bounded by d. Then 

H (R n ) 
hm sup sup — < 2 log a. 

n— >oo oGV \Rn)o 



Proof. Lemma 12.11 and Lemma 12.21 together imply that 

H (R n \\R n \) < (R n ) 2 toed, 

whereas it follows from ()2.5[) that 

H (\Rn\) <log(n + l). 

Hence by (|2.6|) . H (R n ) < (R„) 2 logd + \og(n + 1). Proposition 12.31 completes the 
proof. □ 

3. The transient case 

In this section, we prove Theorem 11.11 asserting that the entropy of R n grows at 
least linearly in its expected size (R n )o on any uniformly transient graph. 

Lemma [3.1 1 and Corollary 13.21 show that H (R„) grows at least linearly in the 
expected size (diR n ) of diR n . These two statements and their proofs are straight- 
forward adaptations of Lemma 3 and Corollary 4 in pQ . 

Lemma 3.1. For any graph G with degrees bounded by d > 2, 

(3.1) P [R n = A] < (1 - d- 1 ) 10 ^ 1 - 1 , for alloeACV. 

Proof. Fix A C V and define the successive entrance times to di A by T\ = rg iA 
and for k > 2, T k = inf{n > T fc _i : X n G d l A}. Then {R n = A} C {T\g iA \ < n} C 
{T|a (i 4|-i < n }- Note that on the event {Tk < oo}, we have Px Tk [XT k +i & A] < 
1 — (1/d). An inductive application of the strong Markov property at the times 
T\g iA \_ x , T\ d . A \_ 2 , ...,Ti therefore yields 



P [R n = A]<P 



p| {T k <n,X Tk +ieA} 
i<fc<|a 4 A|-i 



lx|a,A|-i 

< !--). □ 



Corollary 3.2. For any graph G with degrees bounded by d > 2, 
(3.2) H (R n ) > ({diR n } - 1) log((l - d~ 

where (d l R n ) = E [\diR„ 
Proof. By Lemma |3. II 

H (R n ) > log((l - d- 1 )- 1 ) P i R n = A](\diA\ - 1). L 

Aec 



By the last corollary, it suffices to control (diR n ) in order to prove a lower 
bound on H (R n ). We will eventually prove that many vertices of degree at least 
3 typically belong to diR n . To this end, we prove in the following lemma that such 
vertices exist within constant distance of any vertex in uniformly transient graphs. 
For notational convenience, we introduce the parameter 

(3.3) a= mi 'P x [r+ = oo], 

which is strictly positive if G is uniformly transient (cf. (jl.3p ). 

Lemma 3.3. Let G be a uniformly transient graph with a as in (|3.3[) . Then for 
any x V , there is a vertex y with deg(y) > 3 and d(x, y) < l/a. 
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Proof. Let x £ V, and let r be the distance from x to the vertex of degree at least 
3 closest to x. Then 

(3.4) a < P x [t+ = oo] < P x [Td lB (x,r) < r+] 

= X j r^Py[ T diB(x.r) < T x ]. 

^ degia; ^ ' 

y.y^x 

Consider any neighbor y of x and let I y be the set of vertices in B(x, r) \ diB(x, r) 
that are connected to y in (B(x,r) \ diB(x,r)) \ {x}. Then Iy is connected and 
consists of vertices of degree at most 2. If no vertex in I y is a neighbor of a vertex 
in diB(x,r), then the escape probability on the right-hand side of (|3.4p equals 0. 
Otherwise, the escape probability is equal to the probability that a simple random 
walk on Z started at 1 reaches r before 0, hence to 1/r (see, for example, [4], 
Chapter 3, Example 1.5, p. 179). It follows that a < 1/r. □ 

We will also require the following simple consequence of monotonicity of the 
capacity (cf. (I2~2l0 : 



Lemma 3.4. Let G be a uniformly transient graph. Then 

(3.5) inf capM) > a. 

ACV,A^ 

Proof. By uniform transience, cap({x}) > adeg(x) > a for all x £ V (see (|1.3j) ). 
The statement (|3.5p thus follows immediately from monotonicity of cap: 

(3.6) for finite, non-empty sets A C B C V, cap(A) < cap(£>). 

Here is a direct proof of this well-known property: by summing over all possible 
times n and locations y of the last visit of the random walk to the set B and using 
the simple Markov property at time n, we find that, for A, B as above, 



(3.7) cap(A) = V V V P x [t+ > n, X n = y, T +o0 n = oo] deg(a;) 



x£diAn=Oy£diB 



= P ^ t a > n ' x » = y] p vl T B = oo] deg(a;). 

x£diAn=O y ediB 

Reversibility of the simple random walk implies that 

deg(x)P x [r^ > n,X n = y] = deg(y)P y [T A =n,X n = x], 
hence by (|3.7p that 

cap(A) = ^2 deg{y)P y {T A < oo]P y [r+ = oo] < cap( J B), 
yediB 

proving (|3.6[) . □ 

We now come to the main objective of this section. 
Proof of Theorem We will prove that 

(3.8) liminf inf ^ffi" > 0. 

n-»oo o£V (R n ) 



The desired statement then follows by Corollary 13.21 and the fact that 

inf (R n ) -> oo, 

o£ V 
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proved in Proposition 12.31 (indeed, all connected components of a transient graph 
are infinite). Denote the set of vertices with degree at least 3 by V>3 and the 
uniform bound on the degrees by d. Observe that any vertex x in V>3 belongs to 
diR n if it is visited by the random walk, but at least one of its neighbors is not. 
Hence, for any x G V>3, the probability of {x £ diR n } is bounded from below by the 
probability that the random walk first reaches B{x, 1) at some vertex y ~ x, then 
follows the path (y, x, z), where z is the neighbor of x maximizing Pz\ t b~( x i) = 
and does not return to B(x, 1) until time n. By the strong Markov property applied 
at time tb( x ,i) + 2 and at time tbi x ,x)i we hence obtain that for any o € V, 

(3.9) P [xed l R„}> 

Po[r B (x,i)<n-2,(X 1 ,X 2 )o9 TB(xA) = (x, z), r^ x ^ o f 



1 a 

> Po[tb(x,x) < n - *A-ip-ipi for an y x e v >s, 



where we have applied Lemma 13.41 with A = B(x, 1) in order to bound the escape 
probability Pz\ t b~! x i) = °°] from below by a/d 2 . For any r > 1, the random walk 
reaches B(x, 1) if it enters B(x,r) and then follows the shortest path to B(x, 1), 
hence, 

Po[tb(x,i) <n-2}> P \T B { x ,r) < n - 1 - r]d -(r_1) . 
Using this estimate in (j33J), we obtain 

(3.10) (diRn)o> Po[x^d t R n ] 

x£V> 3 

> ) , Po[r B (x,r) < n 



We now fix r = [1/a] + 1 and note that, by Lemma [3.31 U x ^v >3 B(x,r) = V. In 
particular, the random walk is in one of the balls B(x,r), x £ V>3, at every step, 
and therefore visits at least sup^, \B(x, r)\] of them in its first k > steps. 

Hence, we can deduce from (|3.10p that 



(diR n )o > 



\sup xeV \B{x,r)\ J d? 



(R n ) -1-r \ a 
> JTTZ 1 -T^X7> for an y ° e v - 



d l+r J d 3+r > 

Using again that inf oe y(i?„) — > oo by Proposition 12.31 we obtain (I3.8[) . □ 



Remark 3.5. The conclusion (|1 .4[) of Theorem l 1 . 1 I does not hold for every transient 
graph with bounded degrees. For a counterexample, consider a binary tree Tb = 
(Vb, Eb) and denote by Si = {x € Vb : dT 6 (o, x) — 1} the set of vertices at distance I 
from the root. We now split every edge connecting a vertex in Si-i to a vertex in 
Si into a path of length 1 + 2, for I > 1, and thereby obtain the stretched tree G = 
(V, -E), illustrated in Figure[TJ Let now (ct„)„>i be the successive displacements of 
the random walk in Vb, that is, <j\ = Ty b and for i > 2, cr, = inf{rt > <Zj_i : X n G 
ry b }. Then elementary computations using the simple Markov property at time 1 
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Figure 1. The stretched binary tree constructed in Remark 13.51 The 
filled vertices are the ones present in the original binary tree 
Tb, the stretched tree G is obtained by adding the unfilled 
ones. 



(cf. [3], Chapter 4, Example 7.1, p. 271-272) show that for any vertex x in Si, 

2 



(3.11) 



P x [d Tb {X ail o) >l} = 



> — > for I > 1. 
-10 2' 



2+ (Z + 3)/(Z + 2) 

Hence, the process (X ak )k>i is transient. In particular, the stretched graph G 
remains transient. Denote by R' n — R n n Vj, the intersection of R n with the vertices 
in the original tree. Then from (|2.6|) , we obtain that 

(3.12) H (R n ) < H (R n \R' n ) + H {R' n ). 

For any fixed realization of R' n of diameter to with respect to the original tree Tt, 
the set R n is determined up to the precise location of at most 2\R' n \ boundary 
vertices on paths of length at most m + 3. Hence, for any such realization, there 
are at most (to + S) 2 '^™' < (c|i? n |) 2 ' fl "' choices for R n . It therefore follows from 
Lemma 12.11 that 

H (R n \R' n ) < cE [\R' n \]logn. 

The amount of time the process X a spends in any given set Sk is by (|3.11[) and an 
elementary estimate on biased random walk stochastically dominated by a geomet- 
rically distributed random variable with success probability 1/5 (cf. IT, Chapter 4, 
Example 7.1 (b), p. 271-272). It follows that 



(3.13) 



Eo\\K\} <l + 5E 



E 

Kfc<T 



{TS fc <«} 



< cEr. 



max rfto,^) 1 / 2 <cE [^/\R^\]. 

0<k<n J 



Applying Jensen's inequality on the right-hand side, we deduce that 

Ho(Rn \K) <c(Rn)l /2 bg U. 
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Regarding the other term on the right-hand side of (|3.12p . we have by the same 
argument as in the proof of Proposition ^. 4[ 

H a {R' n ) < cE [\R' n \] < c(i?„)y 2 , by @33J. 

Upon inserting the last two estimates into p,12|) . it follows from Proposition 12.31 
that sup og y H (R n )/(R n ) tends to as n — > oo, although G is transient. 

4. The recurrent case 

In this section we prove Theorem II. 2 [ showing that sup og y H (R n )/{R n ) tends 
to zero as n — y oo on uniformly recurrent graphs satisfying the growth condition 
CU). 

Recall that any realization of R n belongs to C , P-a.s. (cf. (|2.3jl ). Our argument 
is based on Lemma l2~2l showing that any set A £ C can be explored by a path not 
longer than 2\A\. We will use this fact in the proof of Theorem ll.2l in order to show 
that any realization of R n can be covered by one of at most sup^gy \d e B(x, r) \ d ' Rn \l r 
different collections of balls of radius r. The second key observation will be that 
due to uniform recurrence, most of these balls of radius r visited are typically 
completely covered by the random walk. This will show that the distribution of R n 
is sufficiently concentrated to admit a bound on H (R n ) sublinear in (R n ) . 

Proof of Theorem We denote the uniform bound on degrees by d. Note that we 
can assume without loss of generality that G is connected, for we may otherwise only 
consider the component of G containing the starting vertex o of the random walk. If 
G is finite, then (| 1 . T[) immediately follows from the fact that lim n P [R n = V] = 1. 
Hence, we can from now on assume that G is connected and infinite. In particular, 
Proposition 12.31 is available for application. 

Consider any o £ V and r > 1. For any A £ C (cf. (|2.3[1 ), we define the finite 
sequence of vertices 

(4.1) F r (A) = (xi,...,x k ) 

such that the balls with radius r centered at x\, . . . , Xk cover A, trying to keep k 
as small as possible. We define Fk(A) inductively as follows: by Lemma T2.21 there 
is a nearest-neighbor path pa — (p{0), ■ ■ ■ ,p{l)) in G(A) starting at o = pa(0) and 
visiting all vertices in A in / < 2\A\ steps. Set x\ = o,t\ = 0, and for i > 2 such 
that < oo, define ti as 

t= { > U-i -PA(t) i B(xi-x,r)}, if Ui<j<i_i B( Xj ,r) ^ A, 

1 1 oo, otherwise, 

and, provided ti < oo, define Xi = pa(U)- Since pa is of length at most 2\A\ and 
covers all of A, this yields a finite sequence as in (|4.1j) , where k is the largest index 
such that tk < oo and satisfies 

(4.2) k<l + [2\A\/r]. 
Finally, the construction implies that 

(4.3) utiB(x,,r)DA. 

We now partition the collection C according to the cardinality of F r {A) and the 
number of vertices x in F r {A) with the property that B(x,r) is not completely 

li 



filled by A: we have C — U^Lj uf =0 C(fc, i), where the disjoint collections C(k, I) are 
defined as 

(4.4) C(k, l) = {AeC Q : \F r (A)\ = k, \{x £ F r (A) : A £ B(x, r)}\ = I}. 
We introduce the random variables K and L, defined as 

(4.5) K=\F r (R n )\, L=\{x&F r (R n ):R n ^B(x,r)}\. 
Observe that since | R n | < n + 1, (14.21) implies that 

0<L<K< [2(n + l)/r] + 1, P -a.s. 

Using the elementary estimate (|2.6p together with (|2.5p and Lemma HOI we obtain 
the following bound on the entropy of R n : 

(4.6) H (R n )<H ((K,L))+H (R n \(K,L)) 

<c d logn + E [log \C(K,L)\]. 

In order to bound the cardinality of C(k, I) for k > 1, I > 0, we denote the maximal 
size of the ball and of the sphere of radius r > 1 by 

(4.7) b r = sup \B{x,r)\ < d 1+r and s r = sup |9 e £f(x,r)|. 

x£V xev 

We now bound the number of choices we have when selecting a set A from the 
collection C(k, I). For a set A to belong to C(k, I), F r (A) must consist of k vertices. 
The initial vertex x\ must be o and each successive vertex must be at distance r + 1 
from the previous one. Hence, F r (A) can take at most sj: different values. For every 
choice of F r (A) = (xi, . . . , Xk), there are at most < 2 k different choices of size-Z 
subsets of vertices Xi with the property that B(xi,r) is not completely filled by A. 
After selecting F r {A) and such a subset, A is by (|4.3|) determined up to the at most 
(2 r ) possible configurations in the balls with centers Xi that are not subsets of A. 
Hence, we have 

(4.8) log 01 < fe(logs r + log2) + Mlog2. 
Inserting this estimate into the above bound (j4.6[) on H a (R n ), we obtain 

(4.9) H (R n ) <E [K](logs r + \og2) + E [L]b r log2 + c d logn. 
By (|4.2I) . F r (R n ) is of cardinality at most [2\R n \/r) + 1, hence 

(4.10) E [K] < -{R n ) + 1. 

r 

We now write P{r,n) = sup xeV P x [R^ n i/^ ^ B(x,r)} and bound the other expec- 
tation on the right-hand side of (14.91) . Since F r (R n ) C _R n , 

(4.11) B„[L] < ^ P [r x < n, i?„ ^ Bfo r)] 

n 

= Z P °{ Tx = m \ p x[Rn- m ~£ B(x,r)] (Markov prop.) 

/™-[« 1/4 ] n \ 

< Y ( £ p °[ Ta; = m i p ( r ' n ) + Z p °^ Tx = m] ) 

£CGV V rn=0 m=n-[n 1 /4] + l / 
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Figure 2. The graph constructed in Remark 14. II 



In order to bound P(r, n), fix any x £ V. Denote the successive return times to x 
by (Tfc)fe>i, that is, T\ = r+, and for k > 2, Tf- = inf{n > Tk-i : X n = x}. Since 
any y £ B(x, r) is within distance r of x, we have P x [t v < Tj] > dr r for any such 
y. With the strong Markov property applied at the times Tfc, we infer that for any 
m > 1, 



(4.12) P{r,n)< sup V 



P x [r v >n 1 ^} 

< sup ^ (P x [t v > T m ] + P x [T m > n 1 

X< ^^ y^B(x.r) 

< sup \B{x, r) | f(l -d- r ) m +mP x [T+ > n 1/4 /m 



Inserting the above estimates into (14.91) . we deduce that 
cb 2 r {\ - d- r ) m + cb 2 r m sup P x 



H (R n ) 21ogs r 
< c 



,1/4 



> 



Cdbr 



{Rn)o 



(Rn)o T X eV 

for any r,m,n > 1 and o £ V. By assumption (|1.5[) and Proposition I2.3[ applied 
to the last two terms, the supremum over o € V of right-hand side converges as n 
tends to infinity to (c(logs r )/r) + cb 2 r (\ — Cd. r ) m ■ Letting m tend to infinity, then 
using (jl.6[> . we obtain (jl.7p . □ 



Remark 4.1. The conclusion (jl.7|l of Theorem 11.21 does not hold for every re- 
current graph with bounded degrees, not even without the supremum over o G V . 
As a counterexample, consider an infinite sequence of binary trees of finite depths 
a\ < 0,2 < ■ ■ • , with roots o = 01,02, . . .. Regardless of the choice of a\, ci2, . . ., after 
addition of edges 

{01, o 2 }, {o 2 , 03},... 

one obtains a connected and recurrent graph, illustrated in Figure [U The sequence 
of depths can be chosen recursively such that 

(4.13) P [r 0n > <l/2 
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(observe that the distribution of r Dji depends on ai, . . . , a n _x, but not on a n ). 
Denote the set of vertices in the n-th tree at distance / from o„ by Si, I > 1. 
Observe that by an elementary estimate on one-dimensional biased random walk, 
whenever the random walk reaches a previously unvisited level Si at some vertex 
x £ Si, the probability that the random walk never returns to Si until time a n 
and thereby leaves x in diR a „ is at least 1/3 (see, for example, [4], Chapter 4, 
Example 7.1, p. 271-272). Hence, 



1 a„ 

KmRaJ] > -^P [r Si <a n ] 



3 <=o 

On 

> P [r Sl < ^/o^\P Si [t Si <a n - y/cQ (Markov prop.) 

2=0 

(by (331) 

z=o 

> c^ Sl [d(o n ,^[a„-vVl^<>«)]- 

Since (c?(o„, Xfc ATon ) — (k A T o„)/3) fc>0 is a martingale (see again [3], p. 272), the 
right-hand side is bounded from below by 

cE Sl [(a„ - ^/a^) A r J > c(a„ - y /a^)Ps 1 [r „ > a„] > c'a„ > c'{R an ), 

using again an elementary estimate on one-dimensional biased random walk for the 
second inequality. Hence, Corollary 13.21 shows that H (R n )/(R n ) does not tend to 
for the recurrent graph G defined above. 

5. Vertex-transitive graphs 

This final section contains the proof of Theorem ll.3l on the dichotomy for vertex- 
transitive graphs. Note that, due to vertex-transitivity, iJ D (i?„), and (R n )o do not 
depend on o, so o can be omitted from the notation. We first deduce the estimate 
in the recurrent case. 

Corollary 5.1. Let G be any vertex-transitive and recurrent graph. Then 

lr i \ H(R n ) 

(5.1) ' — > 0, as n — !• oo. 

\Rn) 

Proof. By Theorem 11.21 we only need to check conditions (| 1 . 5j) and (|1.6[) . Due to 
vertex-transitivity, the suprema become superfluous in both conditions. The asser- 
tion (|1.5[) is thus a consequence of recurrence and monotone convergence. Simple 
random walk on a vertex-transitive graph is quasi-homogeneous, see |llj . Theo- 
rem 4.18, p. 47. As a consequence, if liminf r \B(o, r)|/r 3 > 0, then G satisfies 
a three-dimensional isoperimetric inequality, which implies in particular that the 
random walk is transient (we refer to [TT], Corollary 4.16, p. 47, and Theorem 5.2, 
p. 49, for the proofs of these claims). Hence, we must have liminf r \B(o, r)\/r 3 — 0, 
from which (|1.6j) immediately follows. □ 

Theorem 11.31 follows: 



Proof of Theorem \1.3[ Since the infimum in condition (|1.3p is superfluous for vertex- 
transitive graphs, every vertex-transitive and transient graph is uniformly transient. 
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Theorem 11.11 hence shows that 



liminf H(R n )/(R n ) > 



in the transient case. Corollary 15.11 shows that H(R n )/(R n ) tends to for vertex- 
transitive recurrent graphs. Since every vertex-transitive graph is either transient 
or recurrent, this proves Theorem II .31 □ 

In order to answer the question from 1 mentioned after Theorem ll.31 it remains 
to prove that (R n ) grows linearly in n for vertex-transitive transient graphs. This 
is surely well-known, but we could not find a proof in the literature, so we give a 
proof here. We denote the Green function of the simple random walk (evaluated 
on the diagonal) by 



g{x) = E x 



k=0 



for x € V, 



and the Green function of the random walk killed after n steps by 



k=0 



{X k =x} 



for x G V. 



Proposition 5.2. Let G be any transient graph and let 

e (x) = P x [t+ =oo] G (0,1) 
be the escape probability from vertex x G V . Then 

(Rn)o 



(5.2) 



inf e(o) < liminf inf 



Moreover, if G is vertex-transitive and transient, then 

(Rn) 



(5.3) 



e(o) > 0, as n — > oo. 



Proof. Let G be any transient graph and o G V. By the Markov property applied 
at time 1, g(x) = 1 + (1 — e(x))g(x), hence 



(5.4) 



l(x) = l/e(x), for any x G V. 



For n > 1 and x G V, we denote by JV™ the total number of visits to x in the first 
n steps, 

N x = E 1 {X k =x}- 

0<k<n 

Summation over all vertices yields the total number of steps: 
(5.5) n + l=J2K- 
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Taking expectations in this equation and applying the strong Markov property at 
time t x , we obtain that 

(5.6) n + l=^E [N^] 

< Y, P °{ T * < n M%) 

x£V 

< (R„) supg(x). 

x£V 

The estimate (I5.2[) is trivial if inf oe y e(o) = and follows from (|5.4I) and (15. 6p 
otherwise. 

Let now G be a vertex-transitive transient graph. In particular, (|5.2p holds 
without the infimum on either side. Replacing n by [An] with A > 1 in (|5.5p . we 
have for n > c\, 

[\n] + l>J2 E o[N [ x Xn] l {Tx < n} ] 

x£V 

> £ P o\^ < % [A " ] ~» 

x£V 

= (i? n )oS [Anl -». 

Letting n tend to infinity and using (|5.4[) , it follows that for any A > 1, 

limsup i^Hifi < Ae(o). 

We now let A tend to 1 and together with (|5.2[) obtain (|5.3p . □ 

Remark 5.3. Theorem 11.11 and (|5.2p prove that H a (R n ) grows linearly in n for all 
uniformly transient graphs with bounded degrees, while Theorem 11.21 in particular 
proves that H (R n ) grows sublinearly in n on uniformly recurrent graphs satisfying 
the growth condition (11.61) . 

Remark 5.4. The example presented in Remark |4. II does not satisfy the volume 
growth assumption (|1.6p and hence does not show that (|1.6I) is actually necessary in 
Theorem II .21 We do not know of an example showing necessity of (|1.6[> . or indeed 
if there even exists a graph satisfying (|1.5p but not (11.61) . 

Remark 5.5. Given the results of the present work, it is natural to wonder whether 
one can obtain more precise estimates on H (R n ) on vertex-transitive graphs or even 
prove an analogue of Shannon's theorem on the almost-sure behavior of \og(p (R n ) ) • 
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